Participation in Language Resource Development and Sharing

نویسنده

  • Virach Sornlertlamvanich
چکیده

Language resources are really much required for understanding and modeling the language in the present approaches. The language that has a rich language resource gains a big benefit in making a big advance in language processing. On the other hand, the less resource language is struggling with preparing a large enough language resource such as raw text or annotated corpora. It is a labor intensive and time consuming task. Moreover, computerization of the text is another non-trivial effort. There needs a supportive computing environment in inputting, encoding, retrieving, analysis, etc.. Learning from the rich resource languages, we gradually collecting the resource and preparing the necessary tools. Through many efforts in the recent years, we can see some significant outcomes from PAN localization project (2004-2007, 2007-2101, http://www.panl10n.net/), ADD (2006-2010, http://www.tcllab.org/), Asian WordNet (http://asianwordnet.org/), Hindi WordNet (http://www.cfilt.iitb.ac.in/wordnet/webhwn/), BEST (since 2009, Thai Word Segmentation Software Contest, http://thailang.nectec.or.th/ best/) and many NLP summer schools. The activities gain a big potential in leveraging the NLP tools development and research personnel development. It results in a big growth of Asian language resource development and research. With the spirit of sharing on social networking, the resources can efficiently be developed to a satisfied amount in a reasonable time scale. Asian WordNet is an example of developing a set of 13 languages of Wordnet connected via Princeton WordNet. Thai WordNet is open for online collaborative development. About 70K synsets and 80K words of Thai WordNet are available online. ThaiLao conversion is an approach to exhibit the advantage in utilization of language similarity to increase the other language resource. Lao WordNet is created by converting from Thai WordNet by using the phoneme transfer approach. Taking the advantage of language similarity, the language corpus can be obtained by a quick conversion rule. In this case, the study of direct transfer is much more efficient than creating from the scratch. Currently, most of the above mentioned results are open to public for at least research purpose. However, more and more language resources are still needed to improve the language processing. The possible of online collaborative development and sharing is a key factor in the language resource development.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experiences with Shared Resources for Research and Education in Speech and Language Processing

Resource barriers can prevent capable researchers from participating in the speech and language community and can make it difficult to support learning and participation in our field at a wide variety of institutions. Sharing resources, whether software, processed data, experimental methodologies or virtual machines, can reduce the barrier to entry and potentially broaden participation in speec...

متن کامل

Improving Knowledge Sharing, in Research and Development Organizations (Investigating the Role of Culture, Justice and Trust)

Nowadays the effective use of organizational knowledge and sharing it among the staff has become an strategic resource to achieve and maintain competitive advantage. For achieving such Organizations need to have strategies to improve knowledge sharing. The purpose of this study was to investigate the relationship between organizational justice and trust, culture and trust and knowledge sharing ...

متن کامل

Proposing a Model of Co-Creative Participation in Tourism Market

Objective There is a growing interest in the customers’ innovation in the realm of tourism studies. In the new global business ecosystem where individuals, organizations, governments and economy work together as an integrated network, we need a new innovation model. The model should be set at a level in which internal, external, cooperative and co-creative ideas can converge to create organiza...

متن کامل

Examine the Relationship between Types Human Resource Strategies on Employee Participation (Case Study: National Iranian Oil Products Distribution Lorestan Region)

Today, participating in the organization's staff is one of the most basic ways to improve the efficiency and effectiveness of administrative and managerial countries, organizations and companies. According to increasing importance of human resources in organizations, this study was to investigate the relationship between human resource strategies with the participation of employees in the Natio...

متن کامل

Node Participation in Ad Hoc and Peer-to-Peer Networks: A Game-Theoretic Formulation

Ad hoc and peer-to-peer networks sometimes operate as voluntary resource sharing networks, relying on users’ willingness to spend their own resources for the common good. As the costs of such resource sharing (what we call “node participation”) outweigh the benefits perceived by the nodes, users are less likely to participate, compromising overall network goals. The contribution of this paper i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011